A Graphical Test to Examine Local Self-Similarity for Heavy-Tailed Distributions
نویسندگان
چکیده
The Pareto distribution, or power-law distribution, has long been used to model phenomena in many fields, including wildfire sizes, earthquake seismic moments and stock price changes. Recent observations have brought the fit of the Pareto into question, however, particularly in the upper tail where it often overestimates the frequency of the largest events. This paper proposes a graphical self-similarity test specifically designed to assess whether a Pareto distribution fits better than a tapered Pareto or another heavy-tailed alternative. Unlike some model selection methods, this graphical test provides the advantage of highlighting where the model fits well and where it breaks down. Specifically, for data that seem to be better modeled by the tapered Pareto or other alternatives, the test assesses the degree of local self-similarity at each value where the test is computed. The basic properties of the graphical test and its implementation are discussed, and applications of the test to seismological, wildfire, and financial data are considered.
منابع مشابه
Accuracy and Computational Efficiency on the Fractal Traffic Generation
The use of synthetic self-similar traffic in computer networks simulation is of vital importance for the capturing and reproducing of actual Internet data traffic behavior. A commonly used technique for generating selfsimilar traffic is achieved by aggregating On/Off sources where the active (On) and idle (Off) periods exhibit heavy tailed distributions. This work analyzes the balance between a...
متن کاملHeavy-tailed Probability Distributions in the World Wide Web
The explosion of the World Wide Web as a medium for information dissemination has made it important to understand its characteristics, in particular the distribution of its le sizes. This paper presents evidence that a number of le size distributions in the Web exhibit heavy tails, including les requested by users, les transmitted through the network, transmission durations of les, and les stor...
متن کاملOn the relationship between file sizes, transport protocols, and self-similar network traffic
Recent measurements of local-area and wide-area traffic have shown that network traffic exhibits variability at a wide range of scales. In this paper, we examine a mechanism that gives rise to self-similar network traffic and present some of its performance implications. The mechanism we study is the transfer of files or messages whose size is drawn from a heavy-tailed distribution. First, we s...
متن کاملOn the relationship between le sizes , transport protocols , and self - similar network tra c
Recent measurements of local-area and wide-area tra c have shown that network tra c exhibits variability at a wide range of scales|self-similarity. In this paper, we examine a mechanism that gives rise to self-similar network tra c and present some of its performance implications. The mechanism we study is the transfer of les or messages whose size is drawn from a heavy-tailed distribution. We ...
متن کاملInference with Multivariate Heavy-Tails in Linear Models
Heavy-tailed distributions naturally occur in many real life problems. Unfortunately, it is typically not possible to compute inference in closed-form in graphical models which involve such heavy-tailed distributions. In this work, we propose a novel simple linear graphical model for independent latent random variables, called linear characteristic model (LCM), defined in the characteristic fun...
متن کامل